Interference-aware Scheduling for Data-processing Frameworks in Container-based Clusters

نویسنده

  • Miguel G. Xavier
چکیده

With the emergence of data-processing frameworks like Hadoop and Spark, a new concept of a cluster resource manager was necessary to deliver per-application containerwrapped on-demand resources with high scalability on a large scale. The container-based ”Big Data” Operating System concept arose (e.g. YARN and Mesos) and brought along with it the long-standing inter-instance performance interference issues from virtualization technologies. As a result, the performance interference effects between co-located data-processing applications become an uncertain issue in container-based clusters, leading performance to fluctuate unpredictably and the guarantees to be likely violated. To work around this, an interference-aware scheduling algorithm is necessary to mitigate interference-related performance degradation and wisely schedule tasks on the bestsuited compute nodes—the nodes whose performance is maximized and the makespan is minimized.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Heterogeneity-aware scheduler for stream processing frameworks

This article discusses problems and decisions related to scheduling of stream processing applications in heterogeneous clusters. An overview of the current state of the art of the stream processing on heterogeneous clusters with a focus on resource allocation and scheduling is presented first. Then, common scheduling approaches of various stream processing frameworks are discussed and their lim...

متن کامل

Yard crane scheduling in port container terminals using genetic algorithm

Yard crane is an important resource in container terminals. Efficient utilization of the yard crane significantly improves the productivity and the profitability of the container terminal. This paper presents a mixed integer programming model for the yard crane scheduling problem with non- interference constraint that is NPHARD in nature. In other words, one of the most important constraints in...

متن کامل

Hopper: Decentralized Speculation-aware Cluster Scheduling at Scale – Public Review

The huge volume of data available today has led to interest in parallel processing on commodity clusters. Data analytics distributed frameworks such as Hadoop, Spark, or Pregel are designed for parallel processing of a large amount of data. These frameworks break a computation job into small tasks that run in parallel on multiple machines, and aim to scale to very large clusters of inexpensive ...

متن کامل

Green Energy-aware task scheduling using the DVFS technique in Cloud Computing

Nowdays, energy consumption as a critical issue in distributed computing systems with high performance has become so green computing tries to energy consumption, carbon footprint and CO2 emissions in high performance computing systems (HPCs) such as clusters, Grid and Cloud that a large number of parallel. Reducing energy consumption for high end computing can bring various benefits such as red...

متن کامل

Scheduling in Container Terminals using Network Simplex Algorithm

In static scheduling problem, where there is no change in situation, the challenge is that the large problems can be solved in a short time. In this paper, the Static Scheduling problem of Automated Guided Vehicles in container terminal is solved by the Network Simplex Algorithm (NSA). The algorithm is based on graph model and their performances are at least 100 times faster than traditional si...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016